A minimum classification error based distance measure for template based speech recognition
نویسندگان
چکیده
In this paper we investigate the minimum classification error (MCE) criterion for the training of distance measures for template based speech recognition. These MCE-based distance measures are illustrated with example experiments on the Wall Street Journal 5k benchmark for continuous speech recognition.
منابع مشابه
Voice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملA discriminative locally weighted distance measure for speaker independent template based speech recognition
In template based speech recognition, there is a need for a high-performant distance measure between speech frames. Some well known metrics include the Euclidean and the Mahalanobis distance. The recent tendency is to perform a local scaling of the distance metric, defining a set of classes and computing a set of weights for each of these classes. Discriminative training approaches have already...
متن کاملMinimum classification error training in example based speech and pattern recognition using sparse weight matrices
The Minimum Classification Error (MCE) criterion is a wellknown criterion in pattern classification systems. The aim of MCE training is to minimize the resulting classification error when trying to classify a new data set. Usually, these classification systems use some form of statistical model to describe the data. These systems usually do not work very well when this underlying model is incor...
متن کاملارائه یک روش جدید بازیابی اطلاعات مناسب برای متون حاصل از بازشناسی گفتار
In this article a pre-processing method is introduced which is applicable in speech recognized texts retrieval task. We have a text corpus, t generated from a speech recognition system and a query as inputs, to search queries in these documents and find relevant documents. A basic problem in a typical speech recognized text is some error percentage in recognition. This, results erroneously ass...
متن کاملSpeaker Normalization for Improved Automatic Speech Recognition for Digital Libraries
SPEAKER NORMALIZATION FOR IMPROVED AUTOMATIC SPEECH RECOGNITION FOR DIGITAL LIBRARIES Wei Wang Old Dominion University, 2004 Director: Dr. Stephen A. Zahorian The context of the thesis work is the improvement of automatic speech recognition (ASR) for use with digital libraries. First, commonly used multimedia file formats and codecs are surveyed with the objective of identifying those formats t...
متن کامل